Audio Feature Space Analysis for Emotion Recognition from Spoken Sentences

نویسندگان

چکیده

An analysis of low-level feature space for emotion recognition from the speech is presented. The main goal was to determine how statistical properties computed contours features influence signals. We have conducted several experiments reduce and tune our initial set configure classification stage. In process audio space, we employed univariate selection using chi-squared test. Then, in first stage classification, a default parameters selected every classifier. For classifier that obtained best results with settings, hyperparameter tuning cross-validation exploited. result, compared two different languages find out difference between emotional states expressed spoken sentences. show an containing 3198 attributes dimensionality reduction about 80% algorithm. most dominant at this based on mel bark frequency scales filterbanks its variability described mainly by variance, median absolute deviation standard average deviations. Finally, accuracy tuned SVM equal 72.5% 88.27% sentences Polish German languages, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition System For Spoken Japanese Sentences

A speech recognition system for continuously spoken Japanese simple sentences is described. The acoustic analyser based on a psychological assumption for phoneme identification can represent the speech sound by a phoneme string in an expanded sense which contains acoustic features such as buzz and silence as well as ordinary phonemes. Each item of the word dictionary is written in Roman letters...

متن کامل

Noise Analysis in Audio-Visual Emotion Recognition

This paper describes the use of a decision-based fusion framework to infer emotion from audiovisual feeds, and investigates the effect of noise on the fusion system. Facial expression features are constructed from linear binary patterns, and are processed independently of the prosodic features. A linear support vector machine is used for the fusion of the two channels. The results show that the...

متن کامل

Low-dimensional feature space derivation for emotion recognition

An objective of the paper was to determine a set of lowdimensional feature spaces that provide high emotion recognition rates. Candidates for target feature spaces were randomly drawn from a broad pool of speech signal parameters that comprised both commonly used characteristics and newly introduced features. As a result, several four-dimensional feature spaces that provide the highest emotion ...

متن کامل

Extracting GFCC Features for Emotion Recognition from Audio Speech Signals

A major challenge for automatic speech recognition (ASR) relates to significant performance reduction in noisy environments. This paper presents our implementation of the Gammatone frequency cepstral coefficients (GFCCs) filter-based feature along with BPNN and the experimental results on English speech data. By some thorough designs, we obtained significant performance gains with the new featu...

متن کامل

Extracting MFCC Features For Emotion Recognition From Audio Speech Signals

A major challenge for automatic speech recognition (ASR) relates to significant performance reduction in noisy environments. Recent research has shown that auditory features based on Gammatone filters are promising to improve robustness of ASR systems against noise, though the research is far from extensive and generalizability of the new features is unknown. This paper presents our implementat...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Archives of Acoustics

سال: 2023

ISSN: ['2300-262X', '0137-5075']

DOI: https://doi.org/10.24425/aoa.2021.136581